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Szemeredi's Celebrated Theorem 

One of the crowning achievements of combinatorics is 

Szemeredi's Theorem ([S]): Given an integer n > 1 and an integer k > 3, let rfc(n) denote the 
size of any largest subset S of [n] := {1, 2, . . . , n} for which there are no subsets of the form 

{i,i + d,i + 2d,...,i + (k-l)d} (i > 1 , 1 < d < oo) , 

then rfc(n) = o(n). 

The depth and mainstreamness of this remarkable theorem is amply supported by the fact that at 
least four Fields medalists (Klaus Roth, Jean Bourgain, Tim Gowers, and Terry Tao) and at least 
one Wolf prize winner (Hillel Furstenberg) made significant contributions. 

This article is yet another such contribution, and while it may not have the "depth" of the work 
of the above-mentioned human luminaries, it does have one advantage over them. We "cheat" and 
use a computer. It is true that, so far, we can only talk about finite analogs, but we do believe that 
the present approach could be eventually extended to sharpen the current rather weak bounds. 

More specifically, we prove: 

Finite version of Szemeredi's Theorem: Given an integer n > 1 and integers k > 3, D > 1, 
let Rk,D( n ) denote the size of any largest subset S of [n] := {1, 2, . . . , n} for which there are no 
subsets of the form 

{i,i + d,i + 2d,...,i + (k-l)d} (t>l , l<d<D) , 

then there exists a rational number a^.D = Pk,D/Qk,D such that 

,. Rk,D(ji) 

hm = a k ,D ■ 

n—>oo n 



We have (rigorously!) computed ak } D for small k and D in the table below. 
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These numbers can get difficult to compute very quickly, but it can be seen, for example, that 
a k,i = HT"- It turns out that even more is true. Rk,D( n ) is a quasi-linear function of n, and for 
i = l,..., Qk,D there exist integers ak t D,i between and Pk,D — 1 such that 

Rk,D{[Qk,D\ -n + i) = [Pk,D\ ■ n + a k , D ,i ■ 



Our proof is algorithmic, and we show how to find these explicit expressions using rigorous ex- 
perimental mathematics. 

Note that is a non-increasing sequence in D, and Szemeredi's theorem is equivalent to the 

statement that 

lim a k ,d = . 

A Wordy Formulation 

Every subset S of [l,n] = {1,2,3, ... ,n} corresponds to an n-letter word in the alphabet {0, 1} 
defined by w[i] = 1 if and only if i E S. S has an arithmetical progression of size k if there is an 
Equidistant Letter Sequence in the sense of the Bible Codes of the word l fc (i.e. 1 repeated k 
times). Denoting by 2 a place where the occupying letter may be either or 1, we can say that the 
rjfc(ra) of Szemeredi's theorem defined above asks to find the maximal number of l's that an n-letter 
word in {0, 1} may have, that avoids the infinitely many patterns 



(12 d ) fc - 1 l , 0<d<oo. 
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Analogously, the Rk,D(n) of the finite- version Szemeredi's theorem defined above asks to find the 
maximal number of l's that an n- letter word in {0, 1} may have, that avoids the finitely many 
patterns 

(12 d ) fc_1 l , (0<d<D-l). 

Define the weight of a word w to be t len 9 th z # °f ls , L e t, Fk^{z,t) be the weight-enumerator of 
all binary words avoiding the D patterns (12 d ) fe_1 l , (0 < d < D — 1). We will soon see that 
Fk,D(z,t) is a rational function in (z, t). 

Let's treat the more general case of an arbitrary set of generalized patterns. But let's first define 
generalized pattern. 

Definition: A generalized pattern is a word in the alphabet {0, 1,2}, where 2 stands for "space". 
Now let's say what it means to contain a pattern. 

Definition: A word w = w\W2 ■ ■ ■ w n in the alphabet {0, 1} contains the pattern p = p\p2 ■ ■ .p m 
if there exists a position i (1 < i < n — m + 1) such that 

Wi+j-i =Pj , if Pj 7^ 2 , j = 1, . . . , m . 
For example, the word 011101101 contains the pattern 12221 (with i = 3). 

A word w avoids a generalized pattern p if it does not contain it. A word w avoids a set of 
generalized patterns P if w avoids all the members of P. 

Analogous definitions can be made for an arbitrary finite alphabet, where we can use SPACE (_) 
instead of 2. We will now digress to that general scenario, and later specialize back to the binary 
case. 

The General Problem 

Consider a finite alphabet A together with a symbol SPACE( to be denoted by _) not in A. We 
are interested in weight-enumerating the set of words that avoid a set of patterns P, according to 
the weight 

weight(w\W2 ■ ■ ■ w n ) = x[wi]x[w2] • • • x[w n ] , 

where x[a] (a £ A) are commuting indeterminates. For example, weight(PAUL) = x[P]x[A]x[U]x[L] = 
x[A]x[L]x[P]x[U], weight(DORON) = x[D}x[N]x[0] 2 x[R]. 

Let F be the weight-enumerator (sum of weights of its members, a formal power series in the 
variables {x[a\, a € A}) of the set of such words (that avoid P), let's call it, for reasons to become 
clear shortly, S[P,$]. A word belonging to it is either empty, or else starts with one of the letters 
of our alphabet. If you chop that letter, what remains is a shorter word in S[P, 0], but with more 
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conditions, since it can not start with a "chopped pattern" obtained by chopping-off the first letter 
for all those patterns of P that happen to start with that letter or with _ 

This motivates the following 

Definition: Given a word or pattern w = w\W2 ■ ■ ■ w n , let BEHEAD(w) := W2 ■ ■ ■ w n . 

For example, BEHEAD(DORON) = ORON, BEHEAD(PAUL) = AUL, BEHEAD(__L_OV E) = 
_L_OVE . 

Let P be a set of patterns, and let a be any letter of our alphabet A, then let 

P/a := { BEHEAD(p) \ pe P and { pi = a or Pl = _)} . 
For example, if the alphabet is {0, 1}, and 

P = {000, 0_0_0, 0__0__0, 111, 1_1_1, 1_1_1, __101} , 

then 

P/0 = {00, _0_0, __0__0, .101} , 
P/1 = {11,_1_1,_1_1,_101} . 

So if w belongs to our set S[P, 0] and it starts with the letter a, say, then the chopped word 
obviously also avoids P but in addition avoids P/a at the very beginning. This motivates us to 
make yet another 

Definition: Let P and P' be sets of patterns. The set S[P, P'\ consists of all words avoiding the 
patterns in P and in addition avoiding the patterns P' at the very beginning. 

Since every word in S[P, P'\ must be either empty or else begin with one of the letters of our 
alphabet A, we have the linear equation, for the weight-enumerators F[P, P']({x a }), 

F[P,P'] = l + ^2x a F[P,P/aUP'/a] . 

a£A 

If P' contains an empty pattern, then of course we have the initial condition F[P,P'] = 0, since 
not even the empty word avoids the empty word as a factor. 

Of course, we only care about F[P, 0], but in order to compute it, we need to set up a system 
of linear equations featuring lots of F[P,P'] with many other (unwanted!) P', but nevertheless 
finitely many of them. Since the different values of P' that show up on the right side always 
contain shorter patterns, and eventually we get P' that contain the empty pattern so that we can 
use the initial condition, we get finitely many (but possibly a very large number) of equations, and 
as many equations as unknowns. Also, since we know from the outset that a solution exists (from 
the combinatorics), it follows that the system of equations is non-singular, and by Cramer's rule 
that we have a rational function in the variables 

{x[aj \ a £ A } . 
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Specializing 



Going back to the Szemeredi scenario, we have a two-letter alphabet {0, 1} with weight x[0] = 
t, x[l] = zt. For any set of forbidden patterns, in particular, those that avoid arithmetical progres- 
sion of size k with spacings < D, the generating function is of the form 

R(z t) = 

[,) Q(z,t) ' 

where t keeps track of the length of words and z keeps track of their number of Is. 
Expanding R(z, t) as a power-series of t, we get 



R(z,t) = Y,r n (z)t n , 

n=0 

and r n (z) is a polynomial whose degree (in z) is the largest number l's in an n- letter word avoiding 
the set of generalized patterns. By looking at the monomials of the denominator, Q(z,t), and 
searching for the monomial z l P with largest ratio r := i/j, we get that the largest number of l's in 
an n-letter word in {0, 1} is asymptotically nr, and more precisely, we have the behavior described 
above for i?fc,£)(n), as a certain quasi-linear discrete function. 

An Experimental- Yet-Rigorous Shortcut 

Solving a huge system of linear equations with symbolic coefficients is very time- and memory- 
consuming. Restricting attention to the alphabet {0,1}, and letting f(P,P')(n) be the maximum 
number of l's in an n- letter word that avoids the patterns in P and in addition, at the beginning, 
the patterns in P', we get, for n > 0, 

f(P,P')(n) = max{ f(P, P/0 U P'/0) (n - 1) , f{P, P/l U P'/l) (n - 1) + 1 ) . 



(Remember that any word in {0, l} n , not just the one with the largest number of ones avoiding P 
and P' , must start with either a or a l!). We ask the computer to first find the scheme, in terms 
of a binary tree where the left-child of P 1 is P/0 UP'/O and its right-child is P/l U P'/l. Then we 
ask the computer to crank-out lots of data, say, the first 500,000 terms (or whatever is needed), 
and then the computer automatically guesses explicit expressions of the form 

Rk,D([Qk,D] ■ n + i) = [Pk,D\ ■ n + a fc> D,i ,i = l...Qk,D , 

for certain integers Pk,D, Qk,D, and ak t D,i- Once guessed, the computer automatically gives a fully 
rigorous proof, a posteriori, by checking all the above equations, this time symbolically. See the 
sample output of ENDRE at the webpage of this article for an example. 
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Supporting Software 

All this is implemented in the Maple package ENDRE. A Mathematica program is also provided, but 
only for the problems in the context of Szemeredi's Theorem. For efficiency's sake, a Java program is 

also available. See the webpage frittp : //www . math . rutgers . edu/~zeilberg/mamarim/mamar imhtml/szemeredi . html 

for these packages, as well as sample input and output. 
Exact Enumeration 

From Sloane's point of view, it is interesting to crank-out as many terms as possible of i?fc£)(w), 
both for their own sake, and also because they offer upper bounds for r/ c (n). The interesting and 
efficient methods of the recent paper [GGK], that treats r 3 (n), may be useful to output more terms 
of Rk,D{ n ) for larger D, but of course our focus is completely different. We do symbol- crunching 
rather than number- crunching. 

The entries from the above table for ctk,D, imply upper bounds for r^nj^r^n), . . .. 

The Maple package ENDRE also contains programs for the straight enumeration of words of length 
n avoiding a set of generalized patterns, and for computing generating functions, from which the 
exact asymptotics of the enumerating sequence can be easily determined. 

Finite Version of van der Waerden 

van der Waerden's theorem (for two colors) tells you that Wk(n), the number of n-letter words in 
the alphabet {0, 1}, that avoids the generalized patterns 

(12 d ) fc " 1 l , (02 d ) fc " 1 , (0<d<oo) 

is eventually 0. It is still of interest to investigate the finite version, Wk t o(n), the number of n-letter 
words in the alphabet {0, 1}, that avoids the generalized patterns 

(12 d )* -1 l , (02 d ) fe " 1 , (0<d< D -1) . 

The Maple package ENDRE can handle these problems as well. 
Pipe dreams 

For a fixed k, ak,D gets harder and harder to compute as D gets larger and larger, but we believe 
that a clever analysis of the max equations, might lead, one day, to a quantitative understanding of 
how otk,D decreases with D, that may (who knows?) lead to an easier proof of Szemeredi's theorem, 
and more importantly, improved lower bounds on rk(n). 

What we are essentially doing is solving a system of recurrences of the form 

fi(n) =max{f a{ i)(n- 1) + 1, h{i){n - 1)) , 
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for N sequences {fi(n)}, i = 1...N. Here a(i) b{i) are some functions from [1,N] to [l,iV]. It 
may be worthwhile to study such recurrences for their own sake, abstractly, and come up with a 
study of the asymptotic density as they depend on a(i), b(i). It is not hard to show that fi(n) can 
be modeled as 

R(Qn + i) = Pn + c$ , 

however, it is not necessarily true that < Cj < P. Regardless, hopefully we can get some general 
theorems, and since a(i) and b(i) are arbitrary, there is lots of elbow-room for induction. 

Finally, we would check that the particular a(i), b(i) that show up satisfy some general conditions 
that would enable us to get upper bounds on ctk t D as a function of D. 
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